Search CORE

381 research outputs found

Multi-task learning for intelligent data processing in granular computing context

Author: Cocea Mihaela
Ding Weili
Liu Han
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2018
Field of study

Classification is a popular task in many application areas, such as decision making, rating, sentiment analysis and pattern recognition. In the recent years, due to the vast and rapid increase in the size of data, classification has been mainly undertaken in the way of supervised machine learning. In this context, a classification task involves data labelling, feature extraction,feature selection and learning of classifiers. In traditional machine learning, data is usually single-labelled by experts, i.e., each instance is only assigned one class label, since experts assume that different classes are mutually exclusive and each instance is clear-cut. However, the above assumption does not always hold in real applications. For example, in the context of emotion detection, there could be more than one emotion identified from the same person. On the other hand, feature selection has typically been done by evaluating feature subsets in terms of their relevance to all the classes. However, it is possible that a feature is only relevant to one class, but is irrelevant to all the other classes. Based on the above argumentation on data labelling and feature selection, we propose in this paper a framework of multi-task learning. In particular, we consider traditional machine learning to be single task learning, and argue the necessity to turn it into multi-task learning to allow an instance to belong to more than one class (i.e., multi-task classification) and to achieve class specific feature selection (i.e.,multi-task feature selection). Moreover, we report two experimental studies in terms of fuzzy multi-task classification and rule learning based multi-task feature selection. The results show empirically that it is necessary to undertake multi-task learning for both classification and feature selection

Crossref

Online Research @ Cardiff

Portsmouth University Research Portal (Pure)

Dielectric response of soft mode in ferroelectric SrTiO3

Author: Han Jiaguang
Wan Fan
Zhang Weili
Zhu Zhiyuan
Publication venue: 'AIP Publishing'
Publication date: 12/12/2006
Field of study

We report far-infrared dielectric properties of powder form ferroelectric SrTiO3. Terahertz time-domain spectroscopy (THz-TDS) measurement reveals that the low-frequency dielectric response of SrTiO3 is a consequence of the lowest transverse optical (TO) soft mode TO1 at 2.70 THz (90.0 1/cm), which is directly verified by Raman spectroscopy. This result provides a better understanding of the relation of low-frequency dielectric function with the optical phonon soft mode for ferroelectric materials. Combining THz-TDS with Raman spectra, the overall low-frequency optical phonon response of SrTiO3 is presented in an extended spectral range from 6.7 1/cm to 1000.0 1/cm.Comment: 14 pages; 4 figure

arXiv.org e-Print Archive

Crossref

SHAREOK repository

SoK: Training Machine Learning Models over Multiple Sources with Privacy Preservation

Author: Han Weili
Ruan Wenqiang
Song Lushan
Wu Haoqi
Publication venue
Publication date: 06/12/2020
Field of study

Nowadays, gathering high-quality training data from multiple data controllers with privacy preservation is a key challenge to train high-quality machine learning models. The potential solutions could dramatically break the barriers among isolated data corpus, and consequently enlarge the range of data available for processing. To this end, both academia researchers and industrial vendors are recently strongly motivated to propose two main-stream folders of solutions: 1) Secure Multi-party Learning (MPL for short); and 2) Federated Learning (FL for short). These two solutions have their advantages and limitations when we evaluate them from privacy preservation, ways of communication, communication overhead, format of data, the accuracy of trained models, and application scenarios. Motivated to demonstrate the research progress and discuss the insights on the future directions, we thoroughly investigate these protocols and frameworks of both MPL and FL. At first, we define the problem of training machine learning models over multiple data sources with privacy-preserving (TMMPP for short). Then, we compare the recent studies of TMMPP from the aspects of the technical routes, parties supported, data partitioning, threat model, and supported machine learning models, to show the advantages and limitations. Next, we introduce the state-of-the-art platforms which support online training over multiple data sources. Finally, we discuss the potential directions to resolve the problem of TMMPP.Comment: 17 pages, 4 figure

arXiv.org e-Print Archive

Human posture recognition based on multiple features and rule learning

Author: Ding Weili
Hu Bo
Huang Xiangsheng
Liu Han
Wang Xinming
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/11/2020
Field of study

The use of skeleton data for human posture recognition is a key research topic in the human-computer interaction field. To improve the accuracy of human posture recognition, a new algorithm based on multiple features and rule learning is proposed in this paper. Firstly, a 219-dimensional vector that includes angle features and distance features is defined. Specifically, the angle and distance features are defined in terms of the local relationship between joints and the global spatial location of joints. Then, during human posture classification, the rule learning method is used together with the Bagging and random sub-Weili Ding space methods to create different samples and features for improved classification of sub-classifiers for different samples. Finally, the performance of our proposed algorithm is evaluated on four human posture datasets. The experimental results show that our algorithm can recognize many kinds of human postures effectively, and the results obtained by the rule-based learning method are of higher interpretability than those by traditional machine learning methods and CNNs

Online Research @ Cardiff